A TWO PARAMETER RATIO-PRODUCT-RATIO ESTIMATOR 
USING AUXILIARY INFORMATION 

PETER S. CHAMI, BERND SING, AND DONEAL THOMAS 

Abstract. We propose a two parameter ratio-product-ratio estimator for a 
finite population mean in a simple random sample without replacement fol- 
lowing the methodology in Ray et al. [3], Sahai et al. [5, 6] and Singh et al. [7]. 

The bias and mean square error of our proposed estimator are obtained 
to the first degree of approximation. We derive conditions for the parameters 
under which the proposed estimator has smaller mean square error than the 
sample mean, ratio and product estimators. 

We carry out an application showing that the proposed estimator outper- 
forms the traditional estimators using groundwater data taken from a geolog- 
ical site in the state of Florida. 



1. Introduction 

We consider the following setting: For a finite population of size N, we are 
interested in estimating the population mean Y of the main variable y (taking 
values t/i for i = 1, . . . , N) from a simple random sample of size n (where n < N) 
drawn without replacement. We also know the population mean X for the auxiliary 
variable x (taking values Xi for i = 1, . . . , N). We use the notation y and x for the 
sample means, which are unbiased estimators of the population means Y and X, 
respectively. 

We denote the population variances of Y and X by 

1 ^ - 1 ^ _ 

5f.=V(y) = ^^^(y.-F)' and 5|=V(X) = ^^^(X,-X)^ 

i=l i=l 

respectively. Furthermore, we define the coefficient of variation of Y and X as 

Cy = ^ and Cx = 
respectively, and the coefficient of correlation C between the two variables as 

where r = g'^g^ denotes Pearson's correlation coefficient. 

As estimator of the population mean Y, the usual sample mean y, the ratio 
estimator yji = (y/x) ■ X and the product estimator yp = (x ■ y) / X are used. 
Murthy [2] and Sahai at al. [5] compared the relative precision of these estimators 



Date: February 14, 2012. 

2010 Mathematics Subject Classification. 62D05; 62G05. 

Key words and phrases. Estimators; Finite population mean; Mean square error; Bias; Relative 
efficiency. 

Corresponding Author. Doneal Thomas, donealtagmail.com. 



2 



PETER S. CHAMI, BERND SING, AND DONEAL THOMAS 



and showed that the ratio estimator, sample mean and product estimator are most 
efficient when C > ^, < C < ^ and C < — ^, respectively. In other words: 
when the study variate y and the auxiliary variate x show high positive correlation, 
then the ratio estimator shows the highest efficiency; when they are highly negative 
correlated, then the product estimator has the highest efficiency; and when the 
variables show a weak correlation only, then the sample mean is preferred. 

For estimating the population mean Y of the main variable, we proposed the 
following two parameter ratio-product-ratio estimator: 



(1) Va../} 



{l-P)x + l3X 
/3S + (1-/3)X 



y+{l-a) 



/3x+(l- (])X' 
{l-P)x + f3X 



V 



where a, 13 are real constants. Our goal in this article is to derive values for these 
constants a,/3 such that the bias and/or the mean square error (MSE) of ya^js is 
minimal. In fact, in Section 5 we are able to use the two parameters a and /3 
to obtain an estimator y*{C) that is (up to first degree of approximation) both 
unbiased and has minimal MSE. 

Note that ya^p — yi^a.i-p, i-e., the estimator ya^is is invariant under a point 
reflection through {a, (3) = In the point of symmetry (a,/3) = (^, |), the 

estimator reduces to the sample mean, i.e., we have y i i = V- In fact, on the 
whole line /? = | our proposed estimator reduces to the sample mean estimator, 
i.e., y^ i = y. Similarly, we get yi^o = yo,i = y) /X = yp (product estimator), 
and yo,o = yi,i = {v ^) /x = yp (ratio estimator). 

2. First Degree Approximation to the Bias 

In order to derive the bias of tja^p up to O (i), we set 

y-Y , x-X 
ei — — = — and 62 = — = — . 
Y X 

Thus, we have y = y(l-i-ei) and x = X {1 + 62), and the relative estimators are 
given by 

y=^==(l + ei) and i = = (1 + 62) . 
Thus, the expectation value of the e^'s is 

E(e,) = for i = 1,2, 
and under a simple random sample without replacement, the relative variances are 



V.e,(^) = I|^=E(e?)=V(e.)='-^^^^ 



Y2 V ^ ^' n \ Y 



2 



V..e>(S) = W=E(^^)-V(e2)-^-^^^- 



2 



and 

X^ ^ ' ' n \X 

where / = ^ is the sampling fraction. Also, we have 

E(eie2) =^^rCYCx, 
n 

see [3, 5, 7]. Furthermore, we note that E {ele^ — O (;^), and E ^e^e^^ = when 
{i + j) is an odd integer. 
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Now re-expressing (1) in terms of e^'s and by substituting x and y, we have 



1 + 62 - /3e2 
l + /3e2 



r(l + ei) + (l-a) 



l + /3e2 
1 + 62 - /3e2 



^(1 + ei). 



In the following, we assume that^ |e2| < min||jj|, jizrgy| so that we can expand 
(1 + /3e2)~^ and (1 + (1 — /3) 62)""'^ as a series in powers of 62 up to O (e^- We get 



ya,0 = (1 + ei) F • [1 - (1 - 2a)(l - 2/3) 62 + (1 



2/3) e2 + 0(e3)] 



we obtain the 



We assume that the sample is large enough to make |e2| so small that contribu- 
tions from powers of 62 of degree higher than two are negligible, compare [7]. By 
retaining powers up to ef, we get 

(2) y„,0 - r « y {ei - (1 + ei) [(1 - 2a)(l - 2^) 62 - (1 - a - /?)(! - 2/3) el] ] 

Taking expectations on both sides of (2) and substituting C 
bias of Tja.p to order O as 

« — (1 - 2/3) [(1 
n 

1-/ 
n 

Equating (3) to zero, we obtain 

1 



/3) - (1 - 2a) r 



, Cy 
Cx 



Cx 



(3) 



(1 - 2/3) [1 - a - ^ - (1 - 2a) C] C| Y 



(4) 



/3 = 



/3 = l- a- C + 2aC. 



The proposed ratio-product-ratio estimator ^q,^^, substituted with the values of 
/3 from (4), becomes an (approximately) unbiased estimator for the population 
mean Y . In the three-dimensional parameter space (a, /3, C) G M'^, these unbiased 
estimators lie on a plane (in the case /3 = i) and on a saddle-shaped surface, see 
Fig. 1 (a). Furthermore, as the sample size n approaches the population size iV, 
the bias of y^^p tends to zero, since the factor (1 — /)/n clearly tends to zero. 

3. Mean Square Error of ya.fi 

We calculate the mean square error of yafi up to order 0(rr^) by squaring (2), 
retaining terms up to squares in e\ and 62, and then taking the expectation. This 
yields the first degree approximation of the MSE 



(5) MSEi {yafi) 



Y^ {Gl + C\(\ - 2a)(l - 2^) [(1 - 2a)(l - 1p) - 2C]} 



1-/ 



Taking the gradient V = ( ^! m ) (5), we get 



-^Y^ C\ [(1 - 2a) (1 - 2/3) - q (1 - 2^3, 1 - 2 a) . 



(6) VMSEi(y,,^) =4 
Setting (6) to zero to obtain the critical points, we obtain the following solutions: 



(7) 



a 



and /3 



1 

2' 



^We note that min | ^ , | attains its maximal value 2 at 
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Figure 1. (a) Surface of "biasfree estimators" defined by (3) in 
the parameter space (a,/3,C) G M.^. (b) Surface of "AOE parame- 
ters" defined by (8) . The points of intersection of the two surfaces 
(see Section 5) are drawn as black curves. 



or 



(8) C= (1- 2a) (1-2/3). 

One can check that the critical point in (7) is a saddle point unless C = 0, in which 
case we get a local minimum. However, the critical points determined by (8) are 
always local minima; for a given C, equation (8) is the equation of a hyperbola 
symmetric through («,/?) — (^, ^). Thus, in the three-dimensional parameter 
space (a,/3,C) £ E^, the estimators with minimal MSE (or better, minimal first 
approximation to the MSE, see calculation in (9) below) lie on a saddle-shaped 
surface, see Fig. 1 (b). 

We now calculate the minimal value of the MSE. Substituting (7) into the esti- 
mator ya,i3, yields the unbiased estimator y (sample mean) of the population mean 
Y . Thus, we arrive at the mean square error of the sample mean: 

MSE (^11)= MSE (y) = ^—^Y^Cl = ^—^Sl 

By substituting (8) into the estimator, an asymptotically optimum estimator (AOE) 
^1°^ is found. For the first degree approximation of the MSE, we find 

(9) MSEi ^ (C2 - d) = (1 - r^) , 



i.e., the same minimal mean square error as found in [3, 5, 6, 7]. 
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4. Comparison of MSB's and Choice of Parameters 

Here we compare MSEi [ya.p) in (5) with the MSB of the product, ratio and 
sample mean estimators respectively. It is known [3, 5] that 



(10) MSB {y) = ¥ (j/) = ^-Y^a 



(11) MSBi {yji) = '-Y'{C^ + C'x{l- 2C)}, 

n 

and 

(12) MSBi (yp) ^ ^—^Y^{C^ + (1 + 2C)}. 



4.1. Comparing the MSE of the product estimator to our proposed es- 
timator. From [3, 5, 6, 7], we know that for C < — ^, the product estimator is 
preferred to the sample mean and ratio estimators. Therefore, we seek a range of a 
and f3 values where our proposed estimator ya^p has smaller MSB than the product 
estimator. 

From (12) and (5), the following expression can be verified: 
MSBi (yp) - MSBi (y„,^) = 4 ^^—^Y^ C\ [1 + 2a/3 - a - /3] [C - (2a/? - a - /?)] 
which is positive if 

(13) [1 + 2a/3 - a - /3] [C - (2a/3 - a -/?)]> 0. 

We obtain the following two cases 

(i) C > 2a/3 — a — P > ~1 (if both factors in (13) are positive), or 

(ii) C < 2a/3 — a — /3 < — 1 (if both factors in (13) are negative). 

Noting that we are only interested in the case C < — 1 , we get from (i) 

> C > 2af3 ~ a - (3 > -I. 

We note that this implies — 1 < C < — 1, and the range for a and /3 where these 
inequalities hold are explicitly given by the following two cases: 

• If ^ < i , then < a < , and 

• if ;9 > 1, then <a< ^0j. 

For any given C, we again note that the two regions determined here are symmetric 
through (a, (3) — |). We also note that the parameters (a, /3) which give an AOB 
(see (8)), which for a fixed C lie on a hyperbola, are contained in these region. 

In the case (ii), where C < — 1 (and therefore automatically C < — ^), the 
following range for a and /3 can be found: 

• If /3 < I , then < a < §0j , and 

• if /3 > then §0- <a< 

The same remark as in the previous case apply. Furthermore, note that for C = — 1, 
the product estimator attains the same minimal MSB as our proposed estimator 
ya,/3 on the hyperbola given by (9). In Fig. 2 (a) we show the region in parameter 
space (a, /3, C) G calculated here and in the next two sections where the proposed 
estimator works better than the three traditional estimators. 
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4.2. Comparing the MSE of the ratio estimator to our proposed estima- 
tor. For C > ^, the ratio estimator is used instead of the sample mean or product 
estimator [3, 5, 6, 7]. As a resuh, we are concern with a range of plausible values 
for a and /3, where j/a,/3 works better than the ratio estimator. 

Taking the difference of (11) and (5), we have 

MSEi (yR) - MSEi (y^,^) = 4 Cj, [2a/? - a - P] [C - 1 - {2ap - a - p)] 

which is positive if 

[2a/3 -a- 13] [C - 1 - (2a^ - a -/?)]> 0. 

Therefore, 

(i) C - 1 > 2a/3 - a - /3 > 0, or 

(ii) C -l<2a(3 - a- P <0. 

Hence, from solution (i), where C > 1, we have 

• If /3 < i, then ^^^j^zr" < " < 

. if/?>i,then5^<a<i^. 
Also, from solution (ii), where | < C < 1, we obtain 

• If /3 < i, then <a < '^2p-i ^ ^^"^ 

• if ;9 > |, then < a < 3^^. 

4.3. Comparing the MSE of the sample mean to our proposed estimator. 

Finally, we compare the MSE (y) to our proposed estimator, MSE (^q, ,g). From 
[3, 5, 6, 7], we know that sample mean estimator is preferred for — ^ < C < ^. 
Taking the difference of (10) and (5), we get: 

MSE (y) - MSEi (y„,^) = ^^Y^C^x (1 " 2a) (1 - 2/?) {2C - (1 - 2a) (1 - 2/3)} 
which is positive if 

(1 - 2a)(l - 2/3) {2C - (1 - 2a)(l - 2/3)} > 0. 

Therefore, either 

(i) a > i, /3 > i and C > ^(1 - 2a)(l - 2/3), or 

(ii) a < i, /3 > i and C < i(l - 2a)(l - 2/3), or 

(iii) a > i, ;9 < i and C < i(l - 2a)(l - 2/3), or 

(iv) a < i, ;3 < i and C > i(l - 2a)(l - 2(5). 

Combining these with the condition — 5 < C < ^, we get the following explicit 
ranges: 

• If < C < i and /3 > i, then i < a < ^f+^g^^^ (from (i)), and 

• if < C < i and /3 < i, then < " < ^ (from (iv)), and 

• if -5 < C < and ;3 > i, then ^^^(tff^i)^ < " < 5 (from (ii)), and 

• if -5 < C < and ;3 < i, then ^ < a < ^^1^^%^ (from (iii)). 

We note that the case C — implies r = 0, and thus the sample mean estimator is 
the estimator with minimal MSE (and, as already noted, y = yi 1). 

In Fig. 2 (a) we show the region in parameter space (a, /3, C) S where the 
proposed estimator y^^p works better than the three traditional estimators. Note 
that the surface of "AOE parameters" in Fig. 1 (b) is a subset of this region, except 
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Figure 2. (a) Region of the parameter space (a,/3, C) G 
where our proposed estimator has lower MSE than the tradi- 
tional estimators, (b) Scatterplot of study and auxiliary variables 
for the groundwater data studied in Section 6. 



for the values C = 0, C = — 1 and C — +1 for which our proposed estimator only 
works as well as^ the sample mean, product and ratio estimator, respectively. 

5. Unbiased AOE 

Combining (4) and (8), we can calculate the parameters a and /3 where our 
proposed estimator becomes - at least up to first approximation - an unbiased 
AOE. We obtain a line with 

(3^^ and C = 0, 

(recall that on this line our estimator always reduces to the sample mean estimator) 
or a "curve" (a* (C), /3* (C), C) e M in the parameter space with 

(14) a*(C)-i |^l±y^^^ and /3* (C) ^ \ (l ± ^C{2C ~ 1)) . 

We note that the parametric "curve" in (14) is only defined for C < or C > 5 - in 
fact, this parametric "curve" is three hyperbolas. The surface of "biasfree estimator 
parameters" in Fig. 1 (a) and the surface of "AOE parameters" in Fig. 1 (b) only 
intersect in these three hyperbolas and the line /3 = 5 and C = 0. In the region 
< C < ^ of the parameter space (a, /3, C) € M.^, we have the common situation 
where minimising MSE comes with a trade-off in bias. The curves of intersection 

^We also remark that the points (1, 1, 1) and (0, 0, 1) (note that j/i i = 5o,0 = 27ii)i (Oj !> ~1) 
and (1, 0, —1) (note that j/i o = 5o i = Vr) as well as the line (a, i^, 0) (note that y„ i = y) belong 
to the surface of "AOE parameters" in Fig. 1 (b). 
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are included in Fig. 1. Explicitly, our proposed estimator using the values in (14) 
is given by 

(15) y*(C) = y„.(c),,3-(c) = 

2 (C + 1) ^2 - 2 (C - 1) x2 + (2 C2 - C - 1) (X - i)' _ 
4X X- {2C'^ - C - 1) {X ~ xf ^' 

At first it might seem surprising that this estimator y* (C) is also defined in'' the 
region < C < |. However, one can also let the parameters (a, /3) in the definition 
of our proposed estimator ya^p in (1) be complex numbers - but such that we still 
get a real estimator. One can check that cx*{C) and /3*(C) in (14) for < C < ^ 
have this property. 

Furthermore, we can check that the first degree of approximation of the bias and 
MSE of y*(C) are given by 

Bi(z/*(C))=0 and MSEi (f (C)) = Sl (l ~ r^) 

n ' 

(compare (3) and (9)). Thus, the estimator y*{C) of (15) is an unbiased AOE. 

One might also ask whether inside < C < ^ there is a choice of real parameters 
(a,/3) G such that we get an AOE with small bias. Using (8) in (3), we get the 
first degree approximation of the bias of an AOE 

Bi {yi%) = ^Ci? ^ [C (1 - 2 C) + (1 - 2/3)2] 

From this expression (and the constraint (8)) it is clear that the bias can only be 
made zero if C < or C > ^. Otherwise, there is always a positive contribution 
coming from the term C(l — 2C) that does not vanish no matter what we choose 
for /3 e M. In fact, it looks as if the choice f3 = always yields the least possible 
bias, however two remarks are in order here: Firstly, given (8) and unless C = 0, 
we can only let /3 be close to ^ and choose a accordingly (the absolute value |a| is 
then large). Secondly, we already noted that y^ i — y, and the MSE for the sample 

mean estimator is ^—^Sv, not ^—^Sv(^ — r^) as for an AOE. We have arrived here 
at a point where the first degree approximation to bias and MSE breaks down. 
To find a choice of real parameters for given C with minimal MSE and least bias, 
higher degrees of approximation would have to be considered. 

6. Application and Conclusion 

Using data taken from the Department of the Interior, United States Geological 
Survey (USGS) [9], site number 02290829501 (located in Florida), a comparison 
of our proposed estimator y^,^ to the traditional estimators was carried out. The 
study variables (denoted by Y) are taken to be the maximum daily values (in feet) 
of groundwater at the site for the period October 2009 to September 2010. The 
auxiliary variables (denoted X) are taken as the maximum daily values (in feet) 
of groundwater for the period October 2008 to September 2009. Our goal is to 
estimate the true average maximum daily groundwater Y for the period October 
2009 to September 2010. 
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The questions we ask are: How many units of groundwater must be taken from 
the population Y to estimate the population mean Y within d = 10% at a 90% 
confidence level (a = 0.10)? And how well do the estimators perform given this 
data set with auxiliary information for the calculated sample size n? 

Using the entire data set, we calculate the following statistics: Y 0.5832, 
X « 0.6277, Sy ~ 0.4480, Sx ~ 0.7222, r « 0.9125, Cy ~ 0.7681, Cx ~ 1-1504 
and C « 0.6092. A scatterplot of the data set is shown in Fig. 2 (b), which adds 
emphasis to the positive measure of association between the study variable Y and 
the auxiliary variable X. 

One should note that the value of C sa 0.6092 lies in the interval (5,1), so 
we choose values of a and /3 from Section 4.2. Indeed, we use (14) and choose 
/? = /3* (0.6092) « 0.3176 and a = a* (0.6092) w -0.3349. Note that /3 = 0.3176 
yields (-0.8704) < a < 0.200573 in Section 4.2. Using the notation of Section 5, 
we also note that j/-o. 3349, 0.3176 — (0.6092). 

6.1. Calculating the sample size n. To estimate the population mean amount 
of groundwater recorded for the state of Florida from October 2009 to September 
2010, a sample of size n is drawn from the population of size N — 365 according 
to the simple random sampling without replacement [1]. A first approximation to 
this sample size needed is the (infinite population) value 



where d is the chosen margin of error from the estimate of y , and Zs. is a standard 
normal variable with tail probability of ^ . Accounting for the finite population size 
TV, we obtain the sample size 



1 




In general, the true value of is unknown but can be estimated using its consistent 
estimator . However, in our case is calculated from the population and is given 
as Si w 0.2006. Therefore, with a = 0.10, d = 10% of F (i.e., d « 0.0583) and 
■^0.05 ~ 1.6449, the sample size can be calculated as no ~ (i-6449)^^-o.2006 ^ 159.59^ 
rounding up gives ng — 160. Therefore, we get n sa 1 \ 1 « 111.23, and thus 

leO + 365 

take n = 112. 

6.2. Relative efficiencies. Table 1 shows the relative efficiencies of the traditional 
estimators (sample mean ratio ya and product yp estimators) and our proposed 
two parameter ratio-product-ratio estimator ija^p for (a,/3) = (—0.3349,0.3176). 
We note that with this choice of parameters, the estimator is an (unbiased) AOE, 
namely y-o. 3349.0.3176 — (0.6092). The table shows that our two parameter ratio- 
product-ratio estimator dominates the traditional estimators in the sense that it 
has the highest efficiency. 

We can also observe that in the computation of the relative efficiency, the speci- 
fication of the sample size n is not important since the finite population correction 
factor {^—^) cancels out (however, this would not be the case for higher degrees of 
approximation) . 
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Table 1. Relative efSciencies comparisons. 



MSE(y) 


MSEfe) 


MSE(y) 


MSE(y) 


MSE(y) 


MSEi(yH) 


MSEi(ap) 


MSEi(j/* (0.6092)) 


100% 


196.11% 


16.73% 


597.28% 



6.3. Constructing a 90% confidence interval for Y using ya,i3- Constructing 
a 90% confidence interval, the following formulation can be used (similar formulae 
hold for all estimators discussed here), see [1]: 

The factor {N — n)/ [N — 1) is the finite population correction. 

Of course, by the choice of the sample size n = 112, we get a margin of error of 
approximately 0.1 ~ 0.0583 here; more precisely, the calculation using the above 
formula yields (y* (0.6092) ± 1.6449 • 0.0423 • 0.8337) = (y* (0.6092) ± 0.0580). 

6.4. Comparison of estimators. To compare the proposed estimator with the 
traditional ones, we selected 10 000 times a sample of size n = 112 and calculate 
the estimators from it. We note that there are i^^^^ ~ 2.5 • 10^® possibilities to 
choose 112 data points out of a total 365 without replacement. 

In Table 2 we show the relative position of the estimators with respect to the pop- 
ulation mean Y. In the 10 000 simulations, our proposed estimator outperformed 
the traditional estimators on 4 701 occasions. The ratio estimator, the suggested 
estimator for this value of C by [2], performs better than our proposed estimator 
2 978 times (in these cases it is actually the best of the studied estimators; note 
that the ratio estimator is the worst 1 349 times). 

In Table 3, we compare the estimators by looking at the following criteria: The 
coverage probability is the proportion of the 90% confidence interval covering the 
population mean Y] as expected, the usual mean sample estimator yields around 
90%, while the ratio estimator and our proposed estimator yield much higher values 
- in this simulation, all intervals calculated from our proposed estimator cover Y. 
For those 90% confidence intervals that do not cover Y, we check whether they lie 
to the left {negative bias) or to the right {positive bias) of Y. We also state the 
statistical information lower and upper quartile and median, we get from the 10 000 
simulations; directly below this we show violin plots for the estimators (the dashed 
line indicates the value F, the dotted lines indicate the 90% confidence interval). 
In the violin plot, we see that the values obtained by our proposed estimators yield 
a narrow normal distribution around the true value (skewness is 0.0046, kurtosis is 
2.9926), while the product estimator gives a spread-out distribution and the (tradi- 
tionally preferred) ratio estimator gives a skewed distribution (skewness is 0.5230, 
kurtosis is 3.4253). Finally, we compare the values of the MSEs; the experimental 
values obtained agree with the theoretical values listed in Table 1. 

We infer that our proposed estimator is more efficient and robust than the tra- 
ditional sample mean, ratio and product estimators. 

6.5. Outlook. Several authors have proposed efficient estimators using auxiliary 
information. For example, Srivastava and Reddy [8, 4] consider a generalisation to 
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Table 2. Comparison of the estimators according to the absolute 
deviation from the population mean Y (in 10 000 simulations). 



position 


counts 




y* (0.6092) 


-Y 


< 




Vr-Y 




< 




\y-Y\ 




< 


Vp 


- Y 




3173 




Vr-Y 




< 


r (0.6092) 


- Y 


< 




\y-Y\ 




< 


yp 


- Y 




2 978 




y* (0.6092) 


-Y 


< 




\y-Y\ 




< 




yR-Y\ 




< 


yp 


- Y 




1528 




y-Y\ 




< 


y* (0.6092) 


- Y 


< 




yR-Y\ 




< 


Vp 


-Y 




972 




Vp-Y 




< 




\y-Y\ 




< 


y* (0.6092) - 


Y 


< 


Vr 


-Y 




766 




y-Y\ 




< 


y* (0.6092) 


- Y 


< 




yp-Y\ 




< 


Vr 


-Y 




308 




y-Y\ 




< 




yp-Y 




< 


y* (0.6092) - 


Y 


< 


Vr 


-Y 




275 



Table 3. Comparison of the estimators in 10 000 simulations. See 
text for details. 



estimator 


coverage 


neg. bias pos. bias 


lo. quart, median up. quart. 


y 

yp 
yp 

y* (0.6092) 


89.86% 
97.01% 
49.59% 
100.00% 


5.14% 5.00% 
0.12% 2.87% 
25.44% 24.97% 
0.00% 0.00% 


0.5583 0.5820 0.6062 
0.5670 0.5831 0.6017 
0.5240 0.5809 0.6410 
0.5732 0.5829 0.5928 



estimator 


^^^^^ MSE(ost) 


y 

yp 
yp 

y* (0.6092) 


0.00127 100% 
0.00068 186.68% 
0.00755 16.79% 
0.00021 598.93% 



0.3 0.4 0.5 0.6 0.7 0.8 0.9 



the product and ratio estimator given by y'^'^^ = y (x/X) ; Reddy [4] also intro- 
duces the estimator y^ = y X / (^X + k [x — X)); in Sahai et al. [5] the estimator 
ykt = y (2 — (S/-?)'') (where "t" stands for "transformed") is considered; and 
Singh et al. [7] introduce a certain class of "ratio-product" estimators given by 
yRp{k) — y (k ■ [X/x) + (1 — fc) • {x/X)Y Choosing appropriate parameters k for 
these estimators and calculating the first degree approximation of the MSE, one 
can show that 

MSEi (y(-^)) = MSEi (yc) = MSEi {yet) = MSEi (yflp(^^)) 

= 1^5f. (1 - r2) . 
n 

Thus, these estimators and our proposed estimator (see (9)) are equally efficient 
up to the first degree of approximation. Indeed, all these estimators give similar 
results as our proposed estimator in the above application, see Table 4. Comparing 
the first degree of approximation of the bias (doing calculations as in Section 2) 
reveals why our unbiased AOE y*{C) and Reddy's yc behave similar - they are 
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Table 4. Comparison of AOEs in 10 000 simulations. See text for details. 



estimator 


coverage 


neg. bias 


pos. bias 


lo. quart. 


median 


up. quart. 


y* (0.6092) 


100% 


0% 


0% 


0.5732 


0.5829 


0.5928 


y(-0.6092) 


100% 


0% 


0% 


0.5738 


0.5835 


0.5935 


yo.6092 


100% 


0% 


0% 


0.5732 


0.5829 


0.5928 


yo.6092 t 


100% 


0% 


0% 


0.5719 


0.5816 


0.5915 


yflp(0.8046) 


99.99% 


0% 


0.01% 


0.5749 


0.5850 


0.5953 



estimator 


MSE 


(0.6092) 

^(-0.6092) 

yo.6092 
yo.6092 t 

yflP (0.8046) 


0.000212 598.93% 
0.000214 593.91% 
0.000212 598.92% 
0.000215 590.51% 
0.000228 555.09% 



yc 
yc I 



-EE- 



0.54 0.5fl C.aa O.Qfl 0.fl2 0.fl4 



both unbiased AOEs: 

M{y) = Bi (r (C)) = Bi (yc) - 0, Bi (m) = ^(1 - C)Cj,y, 

n 

M,{yp)-'^CCj,y, Bi(y('^))=i^^(i-^Ciy, 
n \ / n 2 

n 2 n 2 

(With C — 0.6092, only yet is negatively biased, compare the quartiles and the 
box plot in Table 4). For our proposed estimator ya.f3, we are able to use the 
two parameters a and (3 to obtain an estimator y*{C) that is up to first degree 
of approximation both unbiased and has minimal MSE. A thorough comparison of 
these estimators involving higher degrees of approximation of MSE and bias as well 
as accompanying simulations might be desirable, e.g., to find the estimator that 
behaves well if the coefficient of correlation C is not known exactly. 
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